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Abstract 

The card-cyclic-to-random shuffle is the card shuffle where the n cards 
are labeled I,... ,n according to their starting positions. Then the cards are 
mixed by first picking card 1 from the deck and reinserting it at a uniformly 
random position, then repeating for card 2, then for card 3 and so on until all 
cards have been reinserted in this way. Then the procedure starts over again, 
by first picking the card with label 1 and reinserting, and so on. Morris, Ning 
and Peres Q recently showed that the order of the number of shuffles needed 
to mix the deck in this way is n log n. In the present paper, we consider 
a variant of this shuffle with relabeling, i.e. a shuffle that differs from the 
above in that after one round, i.e. after all cards have been reinserted once, 
we relabel the cards according to the positions in the deck that they now 
have. The relabeling is then repeated after each round of shuffling. It is 
shown that even in this case, the correct order of mixing is n log n. 
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1 Introduction 


The subject of mixing times for Markov chains an important and exciting research 
field that has attracted a lot of attention in recent decades. An outstanding subclass 
of Markov chains that has been studied extensively is card shuffling, i.e. Markov 
chains on the symmetric group of permutations of n items that one can think 
of as the cards of a deck. 

One of the early card shuffles to be studied was the random transpositions 
shuffle, where each step of the shuffle is made by picking two cards uniformly 
and independently at random and then swapping them. It was shown by Diaconis 
and Shahshahani Q that the mixing time of this shuffle has a sharp threshold at 

log n shuffles. It is easy to see that at least order of n log n shuffles is required, 
since, by the coupon collector’s problem, it takes this order of shuffles until most 
cards have been touched at all. Closely related to the random transpositions shuffle 
is the top-to-random shuffle where at each step the card presently in position one 
is moved to a uniform random position. The sharp threshold for this shuffle is 
n log n and again it is easy to see that at least order of n log n steps is required for 
mixing, for similar reasons. 

In recent years some more systematic variants of these shuffles have been pro¬ 
posed and analyzed. Mossel, Peres and Sinclair dU and Saloff-Coste and Zuniga 
[|6ll analyzed the cyclic-to-random shuffle, where at time t the card presently in 
position t mod n is swapped with a uniformly random card. Clearly at least once 
per n steps, each card will be touched and one of the interesting questions about 
this shuffle was if 0{n) shuffles is also sufficient to mix the whole deck. The 
answer turns out to be negative; indeed the mixing time is still of order nlogn. 
Pinsky [j5l later introduced the card-cyclic-to-random transpositions shuffle (CCR 
shuffle), where at time t the card with label t mod n (i.e. the card that started 
out in position t mod n) is moved to a uniformly random position. Again it is 
obvious that every card will be touched once every n steps and again one main 
question was if this way of systematically randomizing the cards, suffices to mix 
the whole deck in 0{n), or at least o{n logn), steps. Again the answer turns out 
to be negative; Morris, Ning and Peres [l3]| prove that n log n is still the correct 
order. In this paper we investigate the card-cyclic-to-random shuffle with relabel¬ 
ing (the CCRR shuffle for short). For k = 1, 2,... let round k consist of steps 
fcn + 1, fcn + 2,..., fcn + n of shuffling. The CCRR shuffle is the shuffle that is 
exactly as the card-cyclic-to-random shuffle for the first round. After that how¬ 
ever, the cards are relabeled 1,..., n according to their positions after the first 
round. Next a new round of CCR shuffling is carried out according to the new 
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labels. After that the cards are relabeled again and a new round of CCR is done, 
and so on. The main result of this paper is that relabeling does not help to speed 
up mixing either, at least not more than by a constant. 

Theorem 1.1 The mixing time of the card-cyclic-to-random transpositions with 
relabeling is of order n log n. 

Here, the mixing time is given by 



where Xt G is the state of the deck of cards after t steps of shuffling, vr is the 
uniform distribution on and || ■ Htv is the total variation norm, given in general 
by 



for a signed measure /i on a finite space S. 

2 Proof of the main result 

For the upper bound on Tmix, it suffices to note that the proof in [l3]| for the CCR 
shuffle goes through exactly as it stands there. Hence we will focus entirely on 
the lower bound. The idea of the proof of the lower bound draws on the idea 
behind Wilson’s technique introduced in [[81 and [[91, namely to use an eigenvector 
of the transition matrix for the movement of a single card to build a test function. 
However since estimating the variance of the test function will in fact be quite 
simple here, we will not need Wilson’s Lemma explicitly. A rough outline of the 
proof is 

1. Show that the position of a given card after one round of CCRR is deter¬ 
mined, up to a random term of order ^/n, by where it was reinserted. 

2. In the light of 1, study the idealized motion of a single card which is a 
deterministic function of where it was reinserted. 

3. Show that the transition matrix for one round of idealized single card motion 
has a spectral gap bounded away from 1. 
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4. Use the eigenvector corresponding to the second eigenvalue to construct a 
test statistic. 


5. Estimate, using 1, the expectation and variance of the test statistic applied 
to the CCRR shuffle and establish the lower bound using Chebyshev’s in¬ 
equality. 

Because of the cyclic structure of the shuffle, the movement of a single card 
is not time-homogenous if we consider individual steps of the shuffle. However 
in terms of rounds, the movement of a given card is indeed a time-homogenous 
Markov chain. Let A = A{n) denote the transition matrix of this chain on n cards. 
It turns out that when analyzing this Markov chain, it is convenient to denote the 
possible positions a card can have in the deck as 1/n, 2/n, 3/n,..., 1 (instead 
of the usual 1,2,... ,n). Write Qn '■= [n]/n = {1/n, 2/n, .. ., 1} for the set of 
positions. As usual, we will identify a card with its starting position, i.e. when 
we speak of card a, a G Qn, we are considering the card that starts in position a. 
Since this is the na’th card from the top in the starting order of the deck, we may 
sometimes also speak of this card as card na. 

It is difficult to come up with a closed-form expression for A, but the action of 
A can be probabilistically described as follows. Consider a card that starts a round 
in position a G Qn- Let us refer to the cards 1/n, ...,a — 1/nas white cards and 
to the cards a -f 1/n,..., 1 as black cards. Now in a first stage the na — 1 white 
cards are sequentially picked out and reinserted at independent uniform positions. 
During this stage a certain number of cards will be reinserted above card a in the 
deck whereas the others will be uniformly spread out among the black cards below 
card a. The cards that in this stage end up above card a will form a well-mixed 
layer of white cards. Note that during stage 1, card a will move gradually higher 
up in the deck. (Here we say that if a < 6, then position a is higher up than, or 
above, position b.) 

Next, after stage 1, card a itself is picked out and reinserted at a uniformly 
random position U = Un & Qn', this is stage 2. In the third and final stage, the 
black cards are picked out and reinserted. If card a was reinserted in the white 
layer at the top, then card a will move gradually down the deck during the whole 
of this stage, whereas if not, then stage 3 divides into the two sub-stages where in 
the first of these, stage 3a, the black cards above card a are reinserted and a moves 
upwards and in the second, stage 3b, the black cards below card a are reinserted 
and a moves down the deck. 

Even though we will not need the exact distribution of where card a ends up 
under this procedure, we will still need a good approximate control. The following 
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two lemmas will be useful for that. 


Lemma 2.1 Let the sequence Yq,Yi, , Fn(i_a) be recursively defined by Yq = 
a G Qn and Yt+i = Yt + 1/n with probability Yt and Yt+i = Yt with probability 
1 — Yt (where these events are conditionally independent ofYo, Fi,..., Yt-i given 
Yt). Then 


E|y.i = (i + “ 


and 


2t 


ly 


Var(y;«)<4E 1 + - -F 1 + - 

n \ n J n ' ^ 


2t 


In particular, for all t, 


VarfFt) < -n 
5 


Proof. By conditioning on Yt we get that 


E[F,+i] = E 


Yt{Yt + -]+il-Yt)Yt 
n 


= E 

+ 

_1 


LV nj J 


which proves the expression for the expectation. For the variance part, write Vt := 
Var(l^). Then no = 0 and recursively 

Var(Fi+i) = E[Var(Ft+i|Fi)] + Var(E[Fi+i|Ft]). 

B y definition of the Ft’s,Var(Ft+i I Ft) = Ft(l—Ft)/n^ andby the above E[Ft+i|Ft] 
(1 + l/n)Yt. For for first term we have 








(1 + l/n)% 1 ( , 2 , t , - 

-5- y \ Vt Y a IlH— 

\ \ n 


2V 


Adding the second term and writing c := 1 + 1/n gives 


vt+i = nt + ^ (l - c*a) 

\ n^/ 

< c^Vt H—5-c*a- . 


( 1 ) 
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This recursion is readily solved and gives 


2t 

a 


vt+i < ^ > y - 


n 


2 

j=t 




By Q, Vt is increasing in t, so we get an upper bound on plugging in t = n{l — a) 
on the right hand side and then get 


^'^+l < 

< 


^ (ei-2- _ e- 
n ' 


a(l 



where the second inequality from standard optimization over a. 


□ 


Lemma 2.2 Let X G L^(M) be a random variable and / : M — )■ M contractive, 

i.e. \f{x) — f{y)\ < |a; — y\for all x, y G M. Then 

Var(/(X)) < Var(X). 

Proof. Let Xi and X 2 be two independent copies of X. Then 

Var(X) = ivar(Xi-A'2) 

= ^El\X,-X,\^] 

> ^El\f{X,)-f(X,)\-^] 

= Var(/(A)). 


□ 


Let Z = be the position that card a = On & Qn ends up in after one round 
of shuffling. For each b G [0,1], define Gf, : [0,1] —)■ [0,1] as 


Gbiu) 


’^u, u < uoih) := 1 — (1 — b)e^ 

gE '’(!-«) — (1 — u > uo{b) 


Note that Gb{u) is continuous in {b,u) and for each b, Gb is differentiable for 
u 7 ^ uo{b). See Figure [^to see a plot of Gb for a few different b. The functions Gb 
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Figure 1: The function Ghiu) for 6 = 0.3, 0.5, 0.7, 0.9. The smaller the h, the 
larger the ascent at the origin. 


play a central role in the following lemma, which gives control over the asymptotic 
distribution, expectation and variance of Z given U. The limiting distribution 
is known and due to Pinsky, see Theorem 4 of [|5l. Since we will also need a 
quantified bound on the variance, we will for self containedness, reprove the result 
below. 

Lemma 2.3 For all u G Qn, 

E[Z\U = n] G (^1 ± Ga{u). (3) 

and 

Var(Z|t/ = u) < 9n“^, 
or equivalently, writing Z = E[Z\U] + D, 

Var(i7) < 9n“^ (4) 

In particular, ifon^a^ [0; 1]> ^hen 

Zn 4 Ga{V) 

where V is uniform on [0,1]. 
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Proof. Let W = Wn G Qn be such that nW is the number of white cards 
that go to the top layer of white cards in stage 1. We will start by estimating the 
variance of Z given W = w and [/ = u. If m < tu, so that stage 2 moves card a to 


the top white layer, then by Lemma 2.1 


E[Z\U = u,W = w] = (l + - 

\ n 

and 

VarfZlf/ = M, VL = tu) < -n~^. 

5 

The case u > w takes some more work. In order to not overly burden the notation, 
we will until further notice, point out the conditioning onU = u and/or W = w 
by writing indexes u and/or w at the conditional expectations and variances. 

Let S' G Qn be such that nS is the number of black cards in positions w + 
1/n,..., M — 1/n; these are the black cards that get reinserted in stage 3a. Let 
nN be the number of cards below card a after these nS black cards have been 
reinserted. Note that when the first part of stage 3 starts, then card a is in m and at 
that point, the number of black cards below a is n(l — a — S'); these are the ones 
that will get reinserted in the stage 3b. Hence by Lemma [ZT| 


) n(l—a) 

u 


f 1 \ n{l-a-s) 

En,n,[Z|iV,S] = (l + -) (1-^) 

EuAN\S] = (l + ^) (!-“)■ 


and 


Hence 


= (l-En,n,[iV|S]) 


\ n{l—a—S) 


I \ n(l—a—S') / I \ n(l—a) 

= I1 + - - l + Q 


It follows that 

[Z] = E^ 


1 

1 H— 
n 


n{l—a—S) 


n 


1 

1 H— 
n 


n{l—a) 


(l-w). (5) 
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We also get that 

Var„,^(Z|^) = E„,^[Var„,^(Z|iV,5)|^]+Var„,^(E„,^[Z|iV,^]|^) 


< -n + Var„ ^ 
5 


1 

1 H— 
n 


n{l—a—S) 


< ^(l + e^)n“\ 

5 


\ 2n{l—a—S) 2 


-n 

5 


(1-iV) 




Therefore 


Var„,^(Z) = E„,^ [Var„,^(Z|S')] + Var„,^ (E„,^[Z|S']) 


< 5(1 + ^ + Var„,^ 


\ n{l—a—S)' 

1 + - 


n 


< -(l + e^)n ^+Var„^(eS') 

5 


< 


2 13 . 

5 ^ 


n 


by Lemma 2.2, sinee the map S —>■ (l/e)(l + is eontraetive and nS, 

given U = u and W = w, is hypergeometrie with varianee at most n/4. 

Now bring baek the eonditioning on W into ordinary notation. What we have 
just shown is among other things, that Var„(Z|lL) < Cn with C := 2/5 + 
13eV20. Thus Var„(Z) = E„[Var„(Z|lL)]+Var„(E„[Z|lL]) < C'n+Var„(E„[Z|lL]). 
However, by Q 


|E„ [Z\W = w]-E^[Z\W = w-l/n]\ 



/ I \ n{l—a—S) 



/ I \ n(l—a—S') 


E„ 

1 

+ 

W = w 

1 

s 

1 

+ 
g 1 ^ 

W = w — 1/n 


< E„ 


1 

1 H— 
n 


n{l—a—S) 


1 

1 H— 
n 


W = w 


e 

< 

n 


where the first inequality uses that the eonditional distributions of nS given W = 
w and W = w — 1/n respeetively, ean easily be eoupled so that the realizations 
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do not differ by more than 1. It now follows that 


Var„(E„[Z|ll^]) < e'^NoiuiW) < 

0 

where the seeond inequality follows from Lemma [ZT| Henee 

Var„(Z) < ^ < 9^”^- 

This allows us to write Z = E[Z|[/] + D, where D = Z — E[Z|f/] has 
Var(L)) = E [Var(L)|L)] = E [Var(Z|L)] < 9n-\ 


This finishes the proof of the varianee part of the lemma. 

Next let n —)■ oo, eonsidering for eaeh n eard a„, where an ^ a e [0,1]. 
Then, by Lemma 2.L E[1 — Wn] —>■ (1 — a)e“ so sinee Var(114) < 2n~^l5 —)■ 0, 
Wn eonverges in probability to uo{a) = 1 — (1 — a)e“. Given Un = Un ^ u 
and Wn = Wn ^ uo{a), we get by the above that for the ease u < uo{a), so 

that Un < Wn eventually, that A e^~°'u. For u > uo{a), so that Un > Wn 
eventually, nS is hypergeometrie and has expeetation n(M„—Wn) (1— a„)/ (l—Wn). 
Plugging in the limit uo{a) of Wn, it follows that Sn eonverges in probability to 
1 — a — e““(l — u). Plugging this into (|^ together with Q and the faet that 

p 

Wn —>■ uo{a), gives that eonditionally on L = Un ^ u with Un > Wn, 


P e 

-)■ e 


'(i-o') _ n - 


(1 — u)e 


1—a 


Summing up, we get that the position of a eard starting from position a„ G Qn, 
an a, after one round of CCRR shuffling eonverges in distribution to that of 
Ga{V) where V is uniform on [0,1], as desired. Also, taking /r := E„^^[S'], we 
have, sinee Var„_^(S') < n~^/A, 

E.,^[e-^] = G e-^(l, 1 + Var(S)) C ± • 

Plugging this into Q , letting n ^ oo and again using the eonvergenee of Wn 
gives @. □ 

Reeall that we write A = A{n) for the transition matrix of the movement 
of a eard under one round of CCRR. Write B = B{n) = [6^^] for the tran¬ 
sition matrix of a eard that moves aeeording to Ga{V). More preeisely, let V 


10 






be uniform on [0,1] and let bij be the probability that GaiV) E {j — 1/n, j), 
i,j E {1/n, 2/n,..., 1}, where a is ehosen uniformly at random in (i — 1/n, i). 
The preeise definition of B is taken so that the stationary distribution under B is 
uniform. In partieular B is doubly stoehastie and B^ is the transition matrix of 
the reversed Markov ehain. 

The next lemma states that the matrix B has a nontrivial eigenvalue bounded 
away from 0 . 

Lemma 2.4 The transition matrix B{n) has a (possibly complex) second eigen¬ 
value A such that |A| > 0.08. 

Remark. Matlab evaluations up to n = 10^ strongly suggest that the seeond 
eigenvalue is real and in the interval ( 0 . 21 , 0 . 22 ). 

Proof. Write B = S -E D where S is the symmetrie matrix (B + B^)/2 and 
D is the skew-symmetrie matrix (B — B'^)/2. We elaim the following. 

Lemma 2.5 The second largest eigenvalue of S is at least 0.21 


Lemma 2.6 The (purely imaginary) eigenvalues X of D satisfy |A| < 0.13. In 
particular, the L 2 g-norm of D satisfies ||I1||2,2 < 0.13. 


The usefulness of Lemmas and 2^ and a strategy for proving them, fol¬ 
low from the following faets on stability of eigenvalues, i.e. what ean happen to 
the spectrum of a matrix under perturbations. These results and their elementary 
proofs can be found e.g. at flTl. Recall that a square matrix C is said to be normal 
if CC^ = C^C and note that S and D are both normal. 


Lemma 2.7 Let C be a normal n x n matrix. Suppose that C has an eigenvalue 
Ao and that E is any nxn matrix with ||T^|| 2,2 < Then there exists an eigenvalue 
A ofC + E such that |A — Ao| < e. 

Moreover if X is a complex number such that there exists a vector f such that 
||C0 — A 0 II 2 < e, then C has an eigenvalue Aq with |A — Ao| < e. 


Hence Lemma 2.4 follows immediately from Lemmas 2.5 and |2.6| together 
with Lemma im □ 


Proof of Lemma 2.5 and Lemma 2.6 In the proof of these lemmas, it will be 


convenient to use the following convention: when a function / is defined on 
{l/n, 2 /n,...,l}we will identify it with its extension to [ 0 , 1 ] defined by /(a) = 
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f{n~^ \na?\). By this convention, ||/||2 of the unextended n-dimensional veetor / 
is ^/n times ||/||2 of the extended / as a funetion in L^[0,1]. 

Let us first study S. That (A, 0) is an eigenvalue/eigenveetor pair for S means 
that E[0(Xi)|Xo = a] = A0(a) for all a = 1/n, 2/n,..., 1, where Xi = Xi{n) is 
the position of a eard after one move aeeording to S, starting from Xq. Write Y = 
Y (n) for a random variable distributed as the position after one move aeeording 
to B{n) and let Y*{n) be distributed aeeording to the position after one step of 
(Reeall that B{n) is doubly stoehastie, so that B{n)^ is the transition 
matrix of the reversed CCRR.) Thus Xi is the (uniform) eonvex eombination of Y 
and Y*. The idea now is to find elose enough to an eigenvalue/eigenveetor 

pair to allow us to draw the desired eonelusion from Lemma 2.7[ We do this with 
the aid of Matlab. Some more details on the Matlab eomputations, in partieular 
the eode, ean be found in the appendix. 

We use Matlab to eompute the eigenvalue k = 0.2293... and eorresponding 
eigenveetor x with n = 10^, sealed so that Hxlh = 1- Next let n = 10^ and 
extend x to -0, the linear interpolation of (a slightly smoothed out version (see the 
appendix) of) x- Then we find that 


||E[V^(Xi(n))|Xo(n) = ■] - /^V’(-))ll2 < 0.0012. 


( 6 ) 


To arrive at the desired eonelusion, a good uniform bound on the norm of the 
differenee between E['0(Xi(m)) IXo = ■] and E['0(Xi(?7,))|Xo = ■] for m > n = 

10^ will also be established. The idea is to show that the total variation norm of the 
differenee between the distributions of Ga{V) and Ga+i/m{y) for arbitrary a G 
(0,1), is small. Then this bound will be used to infer the existenee of a eoupling 
of two random variables distributed aeeording to these, sueh that 
F{Ya ^ Ya+i/m) Small. This together with the faet that ||V’||oo is not too large will 
then establish the desired bound. 

Note that the distribution funetion of Ga{V) is G~^ and the density is {d/dx)G~^{x). 
Reeall from Q that uo{a) = 1 — {1 — a)e°‘ is the breakpoint in the expression for 

Gain). 

Claim. We have 


\\Ga{V) - Ga+i/UV)\\TV = raax\G^\x) - G^l^/^{x)\ 

and the differenee |G“^(a;) — maximized when either x = xq := 

Ga+i/m{uo{a + 1/m)) or a; = Ga{uQ{a)). 

Proof of claim. Write b ;= a + 1/m. To prove the elaim, it suffiees to show 

that {d/ dx){G~^{x) —G^^{x)) is negative for x < Ga{uQ{a)) andx > Gb{uQ{h)) 
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and positive for Ga{uo{a)) < x < Gb{uo{b)). This is equivalent to showing that 
G'aiG~^{x)) — is positive for a; < Ga{uo{a)) and a; > Gb{uo{b)) and 

negative for x between the two bounds. The derivative of Ga is given by 


J u < uo{a) 

\ e^-“ - u > uola) 


(7) 


Note that G" > 0 on {uo{a), 1] so that G'^ is non-decreasing on {uq^o), 1]. (This 
is obviously true on [0, Uo(a)) as well, but we will not need that here.) For x < 
Ga(uo(a-)), the difference of the derivatives is constantly > 0. 

When Ga(uo(a)) <x < Gb(uo(b)), G'i,(G^\x)) = e^"^ whereas G'JG-^(x)) < 
— 1, since G'^ is increasing, which is obviously smaller. 

For X > Gb(uo(b)), let 2 : := G“^(a;) and y := G'^^{x). Then, since G'^ is 
increasing, we have 

^ Gb{z) - Ga{z) 

" G',{z) • 

We want to bound this from below. We have 




Since b > 1/m, 


Hence 





m 

Gb{z)-Ga{z) 

ge-*'(l-z) _ gl-6 _ ge-“(l-z) _|_ gl- 

G'biz) 

gl-b _ ^e-*>(l-z) 

> 

^-igi-b _ — z)e^ *’(1-^) 


gl-b _ Qe-*>{l-z) 

> 

m~^{l — z){e^~'^ — e® 


gl-b _ ^e->’{l-z) 

= 

1 

1 

0 
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Therefore 1—y = l — z + z — y> (1 + l/m)(l — z), so 



G'^{y) = + e" - e" 

> ('ge-'>(i+i/m)-e-)(i-.) _ ^ 

> m-^e^-b + e^-“(i--) ('em-e-(i-.) _ 

> — 2m“^ > 0, 


This proves the elaim. 

Now we have that 

0 < G:l,^JGa{no{a))) - G-\G^{uo{a))) 
= - e“-i)G,K(a)) 

< e-i/-(gi/m _ i)G,(„o(a)) 

< m~^. 


□ 


Also 

0 < G-\xo) - G-ly^{xo) = G-\xo) - Uo{a + 1/m). 

Now Gaiuo{a)) = e^““ — e(l — a) from whieh it follows that 

xo-Ga{uo{a)) = Ga+i/m{uo{a +1/m)) - Ga{uo{a)) 

— gl-a _ gl-a-l/m 

m 

< -il-e-^-^/^) 
m ^ 

2ea 

< - . 

m 

The derivative of Ga was given above in Q and is minimized as u I uo{a) and 
then tends to e^““ — > a/2. Since {d/dx)G~^{x) = l/G/{G~^{x)), it 

follows that 

X 

G~\xo) < uo{a) +-^{xq-G aiuoia))) 

< Mo(a)+4em“^. 
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Since = UQ{a + l/m) > Mo(a), it follows that 

G'-^(xo) - G-ly^{xo) < Aem-^ < llm-^. 

Equivalently, for aG {l/?7,,2/?7,, — l/n} and all I, 

i i 

i=i i=i 

< ( 8 ) 


This means that the total variation distanee between the distributions of two eards 
making a move aeeording to B, starting from i and i + 1/m respeetively, is 
bounded by ll/m. Writing Ya{m) for a random variable distributed aeeording 
to the position after one round of CCRR for a eard that starts in position a, a eon- 
sequenee of this is that one ean eonstruet a eoupling of T/(m) and Ya+i/mi'm) 
sueh that ¥{Ya{m) ^ Ya+i/m{^)) < ll/m. More generally, for k < m, one 
ean eouple so that P(Ya(m) ^ ka+fc/m(m)) < llk/m. This entails, with -0 := 
maxx'i/^x) — min^; t/l^x) < 4.5, that 

|E[V'(y;(m))] - El'll){Ya+k/m{m))] \ < ll'ipkm-^ < 50km~^. (9) 

Next we give a bound eorresponding to (|^ for . Note that G'^{j) = G/{j + 
1/m) for a sueh that j + 1/m < Mo(a) and that when j > Uo{a), G/{j) < 
G/{j + 1/m), whereas when j < uoia) <j + 1/m, then G/{j) > G/{j + 1/m). 
Henee hj+i/m — is zero for Uo{i) > b + 1/m, negative for Uo{i) < b and 
positive for the i’s sueh that j < uo{i) < j + 1/m. Henee the sum 


T := 



{h 




i:j<UQ{i)<j+l/m 



gives the total variation distanee between the distributions of two eards making 
one move aeeording to B^ and starting from j and j + 1/m respeetively. The 
number of Ts in the sum equals at most m{uQ^{j + 1/m) — Mq ^(j)) + 1 and 


biJ-\-\lm — 


1 A 


m(G'i)'+(Mo(*)) 


< 1 A 


2 

mi 


where the seeond inequality follows from the bound (Gi) + (M) > i/2 from above. 
Eaeh of the i’s in the sum T is an i sueh that j < uo{i), i.e. i > Mq ^(j). Henee T 
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is bounded by 2 (mq ^(j + 1/m) — Uq ^(j) + l/m)/uQ ^(j). Now Mo(a) = ae“, so 
by the Mean Value Theorem, for some a > Uo\j), 


Uo\j + 1/m) - Mo ^(j) 


e-“ 1 

< -< — < 

ma ma 


1 

Wo^(j)’ 


Henee T is bounded by 1 A 2/muQ^{j)'^. Sinee Mo(a) = 1 — (1 — a)e“ < a^, we 

have Mo ^(j) > ^ 

T < 1 A —. 
mj 

Now, in analogy with the above, let Y*{m) be distributed as the position of a 
eard after one move aeeording to B^, started from a. Then one ean eonstruet a 
eoupling sueh that F{Y*{m) ^ ^a+k/mi^)) < 1 2k/ma and henee 


|E[^(V;(m))] - E[^(V;+,/^(m))]| < ^ 



< 4.5 



( 10 ) 


Now eompare E['0(y/(m)] and E,[ijj{Ya{n))]. For eonvenienee, assume that n|m 
and set m = nl. For a = k/n — r/m, 0 < r < Z — 1, we have by eonvention that 
E,[ijj{Ya{n)] = E['^(y)jQ(n))], where Oo ■.= n~^\na] = (/c + 1 )/m. Then Q shows 
that 

50 

|E[V>(Va(m))] -E[V^(y,„(m))| < - = 0.0005. 


n 


From our Matlab ealeulations, we get max^; I < 100. Then it is elear that 


|E[V'(y„(m))] - E|V)(F„(n))]| < ™ = 0.001. 


Henee 


\\E[i/j{Y{m))] - E[V’(V(n ))]||2 < 0.0015. (11) 

Analogously for eomparing E[ip(Y*{m)] with E[ip(Y*{n))], use ( [T^ to get 

2r 


|E[.A(i;-(m))l - E[,A(y-(m))l| < 4.5 1 A 




and henee some straightforward ealeulations give, using (10), that < (Z + 

1)^/3 and that 1/Zc^ = vr^/O, 


l|E[^(y:*(m))]-E[^(y:*(M))]||2 


< 


< 


0.001 + 
0.001 + 


2-4.5 r 5 1 

10 

^ < 0.033. 

Yn 
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Since Xi is the eonvex eombination of Y and Y*, it follows from ( [TT] ) and ( [T^ 
that 

E['i/j{X.{m))] - E[^/;(X(n ))]||2 < 0.018. (13) 

Combining ( [T3] ) with Q, we find that 

||E[-^(X(m))] — mp\\2 < 0.0192 

for all m > 10^. From this it follows that S has an eigenvalue A with A > 
K — 0.0192 > 0.21 as desired. 

Next we prove Lemma [2^ in a eompletely analogous way. We have that (A, 0) 
is an eigenvalue/eigenveetor pair for D if (l/2)(E[0j(y)] — E[0(y.*)]) = A0(i) 
for all i, where Yi and Y* are, as above, random variables distributed aeeording 
one step of B and respeetively, starting from Xq = i. Again we take n = 10^ 
and use Matlab to get k and ^jJ elose to an eigenvalue and eigenveetor respee¬ 
tively. It turns out that k = 0.0793...Z, so |fi:| < 0.08 and we get -0 < 5. In 
terms of variability however, this ease turns out to be less well behaved. We get 
maxa; |'0'(a;)| < 400 and 


-(E[0(K(n))]-E[0(r(n))])-/s:0(- 


Then the above ealeulations now give 


- (E[0(K(m))] - E[0(X*(m))]) - 


< 0.017. 


< 0.047 < 0.05. 


The desired result follows now follows from Lemma [2771 


□ 


For the remainder of the paper, in the light of Lemma 2.4 we fix A to be 
the eigenvalue of B with the seeond largest modulus. Let 0 be an eigenveetor 
eorresponding to A with ||0||2 = 1. Note that sinee A may be eomplex, so may 
0. (However, as remarked before, Matlab eomputations up to n = 10^ strongly 
suggest that A is real.) The next lemma, whieh we extraet from ([^ in the proof of 
Lemma [23| will be useful in order to show that 0(f) and 0(j) eannot differ mueh 
if i and j are elose. 


Lemma 2.8 Let f : Qn —)■ C and for i E Qn, let Xi be a random variable 
distributed according to the law the position of card i after one move according 
to B. Then for all i E Qn \ {!}. 
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|El/(^i+./„)|-E|/W)]| < -11/11=0. 
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Lemma 2.9 For the eigenvector 0, of B, we have 

II 0 II 1 > 

II0IIOO < 

and 

+ 1 /n) — 0 (z)| < 

/or constants Ci, C 2 onJ C 3 independent ofn and i. 


Proof. Let, as in Lemma 2.8[ Xi be distributed as the position of eard i after 
one move aeeording to -B. By definition of eigenvalue/eigenveetor, E[0(Xj)] = 
Henee by Lemmas 2.8 and 2.4, 


, / M 22|A|-i 275 

//) - /(z + 1 /n) < -< - 

n n 


since ||0||oo 
follows that 


< 1. Write II011 00 = 100n-“ 


1>II</>II2> 



. Since |0(z) — 0(z + l/n)| < 275/n, it 


100n^~^ 

275 

1 


which entails that a > 1/3. This however means that ||0| loo < lOOn so that 
the conclusion from Lemmas [2 .8 1 and [2)4| above can be strengthened to 


./M 27500 

| 0 (z) - 0 (z + l/n)| < 


Now writing ||0||oo = lOOn ^ gives that 


1 > 11011 ^ > 


27500^ 

n8/3 


27500 

E 

i=i 


j > n 


4 / 3-36 


SO that b > 4/9. This shows that ||0||oo < 100n“^/®. Once again bootstrapping 
the bound on \4>{i) — (p{i + l/n)| gives an upper bound of 2750000n“^^/®. Since 
11011 00 > n“^/^, it follows that 


2 75000 0 

II011 1 = 2750000n-^3/9 ^ j = n^/75500000. 

/=i 
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□ 


Let St := X]rSR,/>(i)>o where XI is the position of eard i after t rounds 

of CCRR. The random variable St is going to be the test statistie used to verify 
that order log n rounds are neeessary for the deek to mix. Let X^o be the deek at 
stationarity (i.e. uniform on S„) and let S^o = 4’i^lo)- Note that l^ol > 

for a eonstant C\ independent of n by Lemma 2.9 Sinee the eards now 
move aeeording to A and not B, (j) is not quite an eigenveetor for the motion of a 
eard. However, letting be the position of a eard after t steps aeeording to B and 
eoupling X\ and Yl by using the same uniform random variable V for updating 
(for X^ this is to say that we use n~^ \nV'\), ([^ gives that |IE[2fJ] — ]E[Y^*] | < Ajn. 
Hence by Lemma [Z9| 

= a] - m(Yl)\Yi = all < 

for a constant C 2 independent of n. Hence summing over i with 3fJ0(Xg) > 0 and 
using the triangle inequality gives 

|E[^i|Xo] - A5o| < C 2 n-^/^. 

A straightforward recursion gives, using Lemma [Z9| 


't-i 


E[|5t|] > |A|*5o - |Ar > CslAI 


(14) 


V r=0 


for constants C 3 and C 4 independent of n. 

We also need to bound the variance of St. Let fi{Ui) = E,[Xl\Ui\, where Ut 
is the position where card i is reinserted in round 1. Then we can write X\ = 
/i(f/i)+ei, whereE[e2] < Cg/nbyQ. Hence 0(Xi) = 0(/i(f/i)) + 5^, where the 


variance of 5i is bounded by C^n ^ ■ (n = C^n since by Lemma 2.9 
|(5i| < C3n"'^/®|ei|. 

Now observe that fi{Ui) and fj{Uj) are independent and Cov(5j, 6 j) < 

Also, for all u and v. 


\E[ej\Ui = u] - E[ej\Ui = n]| < —. 


n 


(15) 


To see this, couple the motion of card j under Ut = u with the motion of j under 
Ui = V hy using the same Uk for all /c 7 ^ L If z < j, then after card j has been 
reinserted (at the same position in the two decks), the number of cards of those 
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that remain to be reinserted that are below j, will differ by at most one between 
the two deeks. The same goes in the ease i > j after i is reinserted. Now use 
Lemma [2T1 
We get 


< 

< 


Cov(0(/,(f/,)),E[5,-|f/,]) 

Var((/.(/,(f/,)))V2Var(E[<5,|f/,])V2 


where the seeond inequality uses ( [T5] ) to bound the seeond faetor. Summing up, 
we get 

Cov(0(X,), 0(X,)) < (16) 

from whieh it follows that 

Var(^i) < (17) 


From the eonsiderations leading up to ( [14] ), we ean write E[S't+i|Xf] = \St + Z 
for a random variable Z, whieh is fune tion of St sueh that \Z\< C 2 n Henee, 
sinee E[|S't|] < by Lemma 2^ 

Var(E[^t+i|Xi]) < |A|2Var(^i) + 2C'2|A|E[|5t|]n-"/9 + 

< |A|2Var(^i) + 2C'2|A|ni/® + 


Here, the mid term follows from Cov( S'*, Z) < E[|S'i||Z|]. By (17), max^; E[Var(S' 4 +i|Xt 
x)] < Henee, with ft := Var(S't)and using that Var(S't+i) = E[Var(S't+i|Xt)] + 

Var(E[S't+i|Xt]), we have the reeursive inequality, 

vt+i < |Apft + Cion^^^ 


with fo = 0. It follows that 

j=0 

By eontinuity we also get Var(S'oo) < C'nn^/®. 

Finally let r := [logn/9 log |A|“^J. Then by ([T^, E[|5 't|] > Ci 2 n^/®, so by 
Chebyshev’s inequality, 

mSr\ < 0 
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as n —)■ oo, whereas, sinee E[S'oo] = 0, 


P(|^oo|<n2/9)^l. 

This proves the main theorem. 
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3 Appendix 


For the Matlab computations, we have used three functions, rimatris, riprod 


and riprod2. Recall from Lemmas 2.4 2.5 and 2.6 the transition matrix B{n) 
for which there was established that the second eigenvalue has modulus at least 
0.08, via considerations of approximate eigenvalues and eigenvectors for the ma¬ 
trices S'(n) = {B{n) + B{nY)D{n) = {B{n) — B{n)^)/2. The command 
rimatris (n) produces B{n). The two other functions take an n-dimensional 
vector V as input and return S'(n)v and D{n)\ respectively. Since we needed n to 
be as large as 10^, computation time was an important issue. Therefore the code 
has been optimized for computational speed and it is not quite as straightforward 
as one would at first believe on knowing B{n). Here is the code. 


function A=rimatris(n) 


A=zeros(n,n); 

r=zeros(1,n+1) ; 

e=exp (1); 

ep=l/n; 

a=0; 

ea=l; 

ema=l; 

eema=exp(1); 
eep=exp (ep); 
emep=l/eep; 

for i=l:n, 
a=a+ep; 
ea=ea*eep; 
ema=ema*emep; 
eema=eemaemep; 
u=0; 

for j=0:n, 
z=j*ep; 

s=min(e*ema*u,eema(1-u)-e*ema*(1-u))-z; 
while abs(s)>le-12, 

1= (u <= l-(l-a)*ea); 
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u=u-s/(I*e*ema + (l-I) * (e*ema-ema*eema"( 1-u) ) ) ; 

s=min(e*ema*u,eema(1-u)-e*ema* (1-u))-z; 

end 

r (j + 1)=u; 
end 

A(i, :)=r(2:n+l)-r(l:n) ; 
end 

function y=riprod(x); 

n=length(x) ; 
y=zeros(1,n); 
e=exp(1); 
ep=l/n; 
z=ep*(0:n); 
u=z; 
a=0; 
ea=l; 
ema=l; 

eema=exp(1); 
eep=exp(ep); 
emep=l/eep; 

for i=l:n, 
a=a+ep; 
ea=ea*eep; 
ema=ema*emep; 
eema=eemaemep; 

s=min (e*ema*u, eema . "(1-u) -e*ema* (1-u) ) - z; 
while max(abs(s))>le-12, 

1= (u <= l-(l-a)*ea); 

u=u-s . / (I *e*ema + (l-I) . * (e*ema-ema*eema . "(1-u) ) ) ; 

s=min(e*ema*u,eema. (1-u)-e*ema* (1-u))-z; 

end 

r=u (2:n+l)-u(l:n) ; 
y(i)=y(i)+r*x; 
y=y+x (i)*r; 
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end 

y=0.5*7'; 

function y=riprod2 (x) ; 

n=length (x); 
y=zeros (1,n); 
e=exp (1); 
ep=l/n; 
z=ep* (0:n); 
u=z; 
a=0; 
ea=l; 
ema=l; 

eema=exp(1); 
eep=exp (ep); 
emep=l/eep; 

for i=l:n, 
a=a+ep; 
ea=ea*eep; 
ema=ema*emep; 
eema=eemaemep; 

s=min (e*ema*u, eema . "(1-u) -e*ema* (1-u) ) - z; 
while max(abs(s))>le-12, 

1= (u <= l-(l-a)*ea); 

u=u-s./(I*e*ema + (l-I).*(e*ema-ema*eema.(1-u))); 

s=min(e*ema*u,eema.(1-u)-e*ema*(1-u))-z; 

end 

r=u (2:n+l)-u(l:n) ; 
y(i)=y(i)+r*x; 
y=y-x (i)*r; 
end 

y=0.5*y'' ; 

Given these funetions, they have been used with the following set of eom- 
mands. 

A=rimatris (10001); 
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B=(A+A')/2; 

C=(A-A')/2; 

[u,1]=eigs(B,2); 
u=u ( :,2); 
u=100*u; 

1=1 (2,2); 

[w,k]=eigs (C,1); w=100*w; 

for 1=2:25, u (26-1)=u (26)-1*(u(26)-u(25));, end 

for 1=2:75, w(76-1)=w(76)-1* (w(76)-w(75));, end 

du=10000*(u(2:10001)-u(1:10000)); 

dw=10000*(u(2:10001)-w(1:10000)); 

x=0:10000; 

xx=0:0.1:10000; 

y=interpl(x,u,xx) ; 

y=y' ; 

z = interpl(x,w,xx) ; 
z=conj(z') ; 
r=riprod(y)-l*y; 
s=riprod2(z)-k*z; 
sqrt(r'*r/100000) ; 
sqrt(s'*s/100000) ; 
max(abs(du)); 
max(abs(dw)); 

Then u and w are first the normalized eigenvectors of B{n) and D{n) respec¬ 
tively for n = 10"^. These are then smoothed out, whereupon y and 2 : are the linear 
interpolations of the smoothed-out vectors. The commands max {abs (du) ) and 
max{abs{dw)) give 4 > in the respective cases. 
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